Model-Selection for Non-parametric Function Approximation in Continuous Control Problems: A Case Study in a Smart Energy System

نویسندگان

  • Daniel Urieli
  • Peter Stone
چکیده

This paper investigates the application of value-function-based reinforcement learning to a smart energy control system, specifically the task of controlling an HVAC system to minimize energy while satisfying residents’ comfort requirements. In theory, value-function-based reinforcement learning methods can solve control problems such as this one optimally. However, since choosing an appropriate parametric representation of the value function turns out to be difficult, we develop an alternative method, which results in a practical algorithm for value function approximation in continuous state-spaces. To avoid the need to carefully design a parametric representation for the value function, we use a smooth non-parametric function approximator, specifically Locally Weighted Linear Regression (LWR). LWR is used within Fitted Value Iteration (FVI), which has met with several practical successes. However, for efficiency reasons, LWR is used with a limited sample-size, which leads to poor performance without careful tuning of LWR’s parameters. We therefore develop an efficient meta-learning procedure that performs online model-selection and tunes LWR’s parameters based on the Bellman error. Our algorithm is fully implemented and tested in a realistic simulation of the HVAC control domain, and results in significant energy savings.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Qualitative State Observer

The state estimation of a quantized system (Q.S.) is a challenging problem for designing feedback control and model-based fault diagnosis algorithms. The core of a Q.S. is a continuous variable system whose inputs and outputs are represented by their corresponding quantized values. This paper concerns with state estimation of a Q.S. by a qualitative observer. The presented observer in this pape...

متن کامل

Joint Allocation of Computational and Communication Resources to Improve Energy Efficiency in Cellular Networks

Mobile cloud computing (MCC) is a new technology that has been developed to overcome the restrictions of smart mobile devices (e.g. battery, processing power, storage capacity, etc.) to send a part of the program (with complex computing) to the cloud server (CS). In this paper, we study a multi-cell with multi-input and multi-output (MIMO) system in which the cell-interior users request service...

متن کامل

Vibration Response of an Elastically Connected Double-Smart Nanobeam-System Based Nano-Electro-Mechanical Sensor

Nonlocal vibration of double-smart nanobeam-systems (DSNBSs) under a moving nanoparticle is investigated in the present study based on Timoshenko beam model. The  two  smart  nanobeams (SNB) are  coupled  by  an  enclosing  elastic  medium  which  is  simulated  by  Pasternak foundation. The energy method and Hamilton’s principle are used to establish the equations of motion. The detailed param...

متن کامل

Optimal Energy Procurement of Smart Large Consumers Incorporating Parking Lot, Renewable Energy Sources and Demand Response Program

Large commercial and industrial loads known as large energy consumers are always seeking to reduce their energy costs and consequently they are utilizing renewable and non-renewable energy sources in procurement of their required energy. Use of renewable energy sources (RESs) and plug-in electric vehicles (PHEVs) parking lot without proper planning will make technical and economic problems for ...

متن کامل

Designing Decision Maker in a Smart Home for Energy Consumption Optimization Using Fuzzy Modeling

existed electricity grids deliver produced power to the consumer passing through transmission and distribution grids. According to high losses of these grids in transmission level and inexistence of bilateral interaction for simultaneous information exchange, a concept of smart grids were made by capabilities such as consciously participation of consumers in the smart electricity grids, an amou...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013